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Abstract — Images obtained from coherent illumination pro- 
cesses are contaminated with speckle noise, with polarimetric 
synthetic aperture radar (PolSAR) imagery as a prominent 
example. With an adequacy widely attested in the literature, the 
scaled complex Wishart distribution is an acceptable model for 
PolSAR data. In this perspective, we derive analytic expressions 
for the Shannon, Renyi, and restricted Tsallis entropies under 
this model. Relationships between the derived measures and the 
parameters of the scaled Wishart law (i.e., the equivalent number 
of looks and the covariance matrix) are discussed. In addition, 
we obtain the asymptotic variances of the Shannon and Renyi 
entropies when replacing distribution parameters by maximum 
likelihood estimators. As a consequence, confidence intervals 
based on these two entropies are also derived and proposed 
as new ways of capturing contrast. New hypothesis tests are 
additionally proposed using these results, and their performance 
is assessed using simulated and real data. In general terms, the 
test based on the Shannon entropy outperforms those based on 
Renyi 's. 

Index Terms — Information theory, SAR polarimetry, contrast 
measures. 



I. Introduction 

POLARIMETRIC synthetic aperture radar (PolSAR) has 
been used to describe earth surface phenomena This 
technology uses coherent illumination which causes the inter- 
ference pattern called 'speckle' |2|, which is multiplicative by 
nature and, in the format here considered, is non-Gaussian. 
This fact precludes the use of conventional tools in PolSAR 
image analysis, requiring specialized techniques. 

The scaled complex Wishart distribution has been success- 
fully employed as a statistical model for homogeneous regions 
in PolSAR images. This law is at the core of segmentation ^3J, 
classification |[4|, and boundary detection |5 | techniques. 

The concepts of "information" and "entropy" were given 
formal mathematical definitions in the context of data commu- 
nications by Shannon in 1948 |6|. Thenceforth, the proposition 
and application of information and entropy measures have 
become an active research field in several areas. Zografos 
and Nadarajah derived closed expressions for Shannon and 
Renyi entropies for several univariate |7|, bivariate |(8|, and 
multivariate |9| distributions. 

Exploring relationships associated with the log-likelihood 
function, Zong |10| applied the Renyi entropy to several 
univariate distributions. In fact, this entropy measure has been 
applied to image processing problems such as data mining, 
detection, segmentation, and classification |[11J-|[13|. Another 
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prominent entropy measure is the restricted Tsallis entropy. 
This tool was introduced by Tsallis in [14J , [15J and is related 
to the Renyi entropy. The restricted Tsallis entropy has found 
applications in statistical physics |16l. 

Among these information theoretical tools, the Shannon 
entropy has been applied to PolSAR imagery. Morio et ah (TTj 
analyzed such entropy for the characterization of polarimetric 
targets under the complex, circular, and multidimensional 
Gaussian distribution. 

Stochastic distances are also derived within the framework 
of information theory. A comprehensive examination of these 
measures is presented and applied to intensity SAR data 
in fTsl, fT9l, and to PolSAR models in |20|. 

In this paper, we derive analytic expressions for the Shan- 
non, Renyi, and restricted Tsallis entropies under the scaled 
complex Wishart law. These measures are analyzed as par- 
ticular cases of the (/i, ^)-entropy proposed by Salicru et 
ah |21|. When parameters are replaced by maximum like- 
lihood estimators, these entropies become random variables; 
expressions for the asymptotic variances of the Shannon and 
Renyi entropies are derived (the Tsallis entropy becomes 
analytically intractable and, thus, is no further considered). 

Novel methodologies for testing hypotheses and construct- 
ing confidence intervals are proposed for quantifying contrast 
in PolSAR imagery using these results. Such measures can be 
used in PolSAR segmentation |3|, classification |4|, boundary 
detection ||22|, ||23|, and change detection |[24|. Monte Carlo 
experiments are performed for assessing the performance of 
the discussed measures in synthetic data, and an application 
to real PolSAR data is performed. 

The remainder of this paper is organized as follows. Sec- 
tion [nl recalls the scaled Wishart law. Selected information 
theoretic tools are summarized in Section |llll Section |IV] 
presents the proposed entropies and the asymptotic variance 
of their estimators, along with an application. Section VI 
concludes the paper. 

II. The Complex Wishart distribution 

Full-polarimetric SAR sensors record the complex 
scattering coefficient for the four combinations of the 
received and transmitted complex linear polarizations: 
S\m (horizontal-horizontal), 6'hv (horizontal- vertical), 
5'vH (vertical-horizontal), and 5'vv (vertical-vertical). When 
natural targets are considered, the conditions of the reciprocity 
theorem |[2|, | [25| , [26J are satisfied and it can be assumed 
that 5'hv — 'S'vH- 

In general, we may consider systems with m polarization 
elements, which constitute a complex random vector denoted 
by: 



y=[SiS2-- - SJ\' 



(1) 
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where {-Y is the transposition operator. It is commonly as- 
sumed that the scattering vector y follows a circular complex 
Gaussian law |27|. 

Multilook PolSAR data are usually formed in order to 
enhance the signal-to-noise ratio (SNR): 

where (-j^ is the Hermitian operator, represents the scat- 
tering vector in the zth look, and L is the number of looks, a 
parameter related to the noise effect in S AR imagery. Matrix Z 
is defined over the set of positive-definite Hermitian matrices 
A. Moreover, Goodman showed that LZ follows the ordinary 
complex Wishart law p8| and, therefore, the density of the 
scaled random matrix Z is 

fz{Z'; S, L) = ^^^Ir^^^) [ - Ltv{^-'Z')] , (2) 

where m is the order of X), m < L, Tm{L) = 
^m(m-i)/2 - k), r(-) is the gamma function, 

5] = E{yy^}, and E{-} is the expectation operator. This is 
denoted as Z Wm(5],L). 

III. Information Theory 

Information theory provides important tools for statistical 
inference [29 j , data compression |30|, and image process- 
ing pTj, to name a few applications. In particular, entropy 
is a fundamental concept related to the notion of disorder 
in mechanical statistics [31] . Salicru et al. 121] proposed the 
(/i, (/))-entropy class, which generalizes the original concept. In 
the following, we recall this definition and we derive entropies 
for positive-definite Hermitian random matrices. 

Let fz{Z']6) be a probability density function with pa- 
rameter vector 6 which characterizes the distribution of the 
random matrix Z. The (/i, (/))-entropy relative to Z is defined 
by 

where either ^: [O, oo) ^ R is concave and /i: R ^ R is 
increasing, or (j) is convex and h is decreasing. The differential 
element dZ' is given by 

m m 

dZ' = \[dZii \[ dK{Z,,}d^{Z,,}, 
ij = l 

i<j 

where Zij is the (i, j)-th entry of matrix Z\ and 3? and ^ 
denote the real and imaginary parts, respectively |28|. Table [I| 
shows the specification of h and (j) for the three entropies we 
use in this article: Shannon, Renyi, and restricted Tsallis. 

The following result, derived by Pardo et al. |34l, paves 
the way for the proposal of asymptotic statistical inference 
methods based on^ntrogy.^ ^ 

Lemma 1: Let 6 = [9i 62 ■ ■ ■ OpY be the ML estimate of 
the parameter vector 6 = [61 62 • • • OpY based on a random 
sample of size N under the model f{Z']0). Then 

^[Ht{d)-Ht{d)] -^Af{0,aUe)), 



TABLE I 

(/l, (/))-ENTROPIES AND RELATED FUNCTIONS 



(h, (/)) -entropy 




Hy) 


cf>(x) 


Shannon |2I 


B 


y 


—X \nx 


Restricted Tsallis (order /3 G M+ : /3 / 1) 


y 


—X 

1-/3 


Renyi (order /5 G M+ : /5 / 1) jSsj 


In y 

l-(3 


X^ 



where A/'(/i, a^) is the Gaussian distribution with mean /i and 
variance a^, denotes convergence in distribution, 

aj,{e) = s'jc{e)-'s, (3) 

/C(6/) = E{-d^\nfz{Z;0)/dO'^} is the Fisher information 
matrix, and S = [Si 62 SpY such that Si = dH^{e)/dei 
for i = 1, 2, . . . ,p. 

In the following we introduce a methodology for hypothesis 
tests and confidence intervals based on entropy. 

A. Hypothesis test 

Let Zi be a positive-definite random matrix with probability 
density function defined over A. with parameter vector 6i for 
i = 1,2,. ..,r, where r is the number of populations to be 
assessed. We are interested in testing the following hypotheses: 

r no : Htie,) = Ht{e2) = ■ ■ = HtiOr) = v. 

\ Hi : H^{Oi) + H^{Oj) for some i and j. 

In other words, statistical evidence is sought for assessing 
whether at least one of the r regions of a PolSAR image has 
different entropy when compared to the remaining regions. 

Let 6i be the ML estimate for 6i based on a random sample 
of size Ni under Z^, for i = 1, 2, . . . , r. From Lemma [T] we 
have that 



CTH{ei) 

for i = 1, 2, . . . , r. Therefore, 



-^A/'(0,1) 



V 



Ni- 



Xr 



(4) 



Since v is, in practice, unknown, in the following we modify 
this test statistic in order to take this into account. Considering 
an application of Cochran's theorem |[35j, we obtain: 

\2 



E 



where 



Ni 



i=l 



NiHtm 



(5) 



Salicru et al. | |2T| showed that the second summation in the 
right-hand side of Equation ^ is chi-square distributed with 
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one degree of freedom. Since the left-hand side of ^ is chi- 
square distributed with r degrees of freedom (cf. Equation (|4])), 
we conclude that: 

In particular, consider the following test statistic: 

We are now in the position to state the following result. 

Proposition 1: Let A^^, i = 1, 2, . . . , r, be sufficiently large. 
If • • • , Or) = s, then the null hypothesis Hq can be 

rejected at a level a if Pr(x^_i > s) < a. 

B. Confidence intervals 

Let be the ML estimate of for a sufficiently large sample 
A^. An approximate confidence interval for H^{0) at nominal 
level a is 



N 



(7) 



where is the a/2 quantile of the standard Gaussian 

distribution. ^ ^ 

Consider now 6i and 62, ML estimates based on large 
samples A^i and N2, respectively. An approximate confidence 
interval for H^{K) - H^iO^) is given by 1341 



No 



IV. Results 



In the following, we derive results for the Shannon, Renyi, 
and restricted Tsallis entropies, denoted as i^s, ^r, and Hj, 
respectively, under the scaled complex Wishart law. In partic- 
ular, these measures are algebraically expressed, numerically 
evaluated, and assessed. We adopt 6 = [L, vec(I])^]\ where 
vec(-) is the vectorisation operator, as the working parameter 
vector. Additionally, asymptotic results for these measures are 
computed. To that end, we derive analytic expressions for 
the variance of the considered entropies. The entropies were 
evaluated at the ML estimate values. Subsequently, hypothesis 
tests and confidence intervals are proposed based on Shannon 
and Renyi entropies. 



A. Expressions for the complex Wishart distribution 

We now present three schemes concerning the derivations 
of Shannon, restricted Tsallis, and Renyi entropies. 

1) Shannon entropy: Using the expression of the Shannon 
entropy obtained applying h and (j) from Table |l| to the density 
given in Equation ([2]), we have that 

Hs{0) = - I fz{Z'-^,L)\iifz{Z'-^,L)dZ'. 
= E{-ln/z(Z)}. 



Minor manipulations yield the following result: 
Hs{e) = - mLlnL + (m - L) E{ln \Z\} 
m{m — 1) 

— 2 — "^"^ 

k=0 



LlnlEl 



ln7r+ lnr(L- A:) 



+ LE{tr(5]"^Z)}. 

In | [36| , Anfinsen et al. obtain the following identity: 

E{ln|Z|} = ln|5]| -mlnL, 

where V^m^(-) is the term of order zero of the vth-order 
multivariate polygamma function given by 

m—l 

V^^^^(-) is the ordinary polygamma function expressed by 



S^+Mnr(L) 



for V > (in this case, ip^^"^ is known as the digamma 
function). By the linearity of the expectation operator, the 
following holds true: 



E { tr(5]-^Z)} = tr(5]-^ E{Z}) = tr 



m. 



Thus, the Shannon entropy relative to the random variable 
Z ~ >Vm(5], L) is expressed by 

Hs {6) = In ^ - In L + m In I E | + ml 



(8) 



/c=0 



2) Restricted Tsallis entropy: Based on Table |T| the re- 
stricted Tsallis entropy is defined by 

H^{e) = {i-/3)-\jii,-i), 

where /i^ = E{/^~^(Z)} can be explicitly calculated: 



/3 



X exp[-/3Ltr(X;-^Z0]dZ' 

jr^mL 



(9) 



-1/5 



|5]|^r^(L) 



|^/|(/3-l)(L-m)|^/|L-m 



A 



exp[-Ltr(^E-^ZO]dZ' 



L 



mL 



_^mL(/3-l)^- 



|S//3|^r™(L) 



|s|i(/3-i)r^^(i) 



£7nL 

E{|X|(™-^)(i-«}, 



E||^|(l-/3)(™-L)| 



(10) 



where X ~ Wm(S//?, L). Moreover, Anfinsen et al. |37 
showed that 



(11) 
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where Z ~ Thus, applying ( pTj ) in ([T0|, we have 

that 



r^(L + (l-/3)(m-L))|5]|(i- 



(12) 



by 



entropy: From Table |lj the Renyi entropy is given 
ff^(0) = (l-/3)-ilnM^. 



Notice that this measure also depends on Jl^, which was 
already computed in Equation ([12]). 

Therefore, denoting q = L + (1 — /3)(m — L), the Renyi 
entropy is expressed by 



m{m — 1) 



In TT — m In L + m In 1 5] I 



mq 



ln/3 , j:Z~o [^nTiq-i)-p\nT{L-i)] 



1-P ' 1-/5 • ^''^ 

It is known that, as /3 ^ 1, both the Renyi f38", p. 676] and 
Tsallis [39j entropies converge to the Shannon entropy. Thus: 



hmH^{6)=Hs{e)=hmH^{e). 



(14) 



These convergences hold true regardless the number of looks. 
Moreover, it is important to emphasize that the derived ex- 
pressions can be related to the eigenvalues of the covariance 
matrix. This approach results in new expressions for Hs, Hj, 
and in terms of the geometrical and arithmetic mean 
of these eigenvalues as follows. Let Ai, A2, . . . , Ap be the 
eigenvalues of the covariance matrix X). Following Mardia et 



al ||40|, 



nili Ai and tr(5]) = A^, then 



tr(5]) 



1/n 



(Er=iAO/ 



Thus, setting S)' = E/tr(5]) and adopting X)' as the covari- 
ance matrix in ^ and ([TSj, we have new expressions that 
can be used in place of the ones proposed by Cloude and 
Pottier |41 1 and Yan et al |42|. 

In the following, we examine the behavior of Hs, H^, and 
in terms of X) and L. 

4) Case study: 

Frery et al. |22| observed the following covariance matrix 
on an urban area from the E-SAR image of WeBling (Bavaria, 
Germany): 



962892 19171 - 3579i 
56707 



-154638 + 191388i 
-5798 + 16812i 
472251 



only the diagonal and the upper triangle values are shown. 
Fig. |l(a)| depicts plots of the discussed entropies for 3 < L < 
50, f3 G {0.1, 0.5, 0.8}, and T,u. Considering the same interval 
for the number of looks. 



Kb) 1(c) 



Figs. |l(b)| |l(c)[ and |l(d)| show the 
Shannon, Tsallis (of order /3 = 1 — 10~^), and Renyi (of 
order (3 = 0.1) entropies for the covariance matrix (l + /c)5]f/, 
k e {0,0.1,0.2}, respectively. 

Figs. 1 1(a)] and |l(e)| illustrate the property stated in ([14]). In 
the case shown here, the convergences are from above, i.e., 
Hs is always smaller than and . 



Figs. |l(b)| and |l(d)| suggest that multiplying the covariance 
matrix by a constant — hence increasing its determinant — 
also increases both Shannon and Renyi entropies. As expected, 
increasing the number of looks leads to smaller entropy values 
due to the increased SNR. 

Although the restricted Tsallis entropy can be used in 
several fields of image processing |17|, the derivation of its 
variance does not lead to a mathematically tractable expres- 
sion. Thus, henceforth we focus our attention on the Shannon 
and Renyi entropies, which allow the necessary algebraic 
manipulations for the the method described in Section [In 
In the next section, we derive asymptotic variances for the 
Shannon and Renyi entropy estimates. 

B. Asymptotic variances 

Let Z be a random matrix which follows a scaled complex 
Wishart distribution with parameter 6 as already defined. Its 
log-likelihood is 

£{e) =mLlnL + (L-m) ln|Z| - Lln\T,\ 
m(m — 1) 



InTT - ^ lnr(L - k) - Ltr(5]"^Z). 



Since S^(6/)/5vec(5]) = vec(5^(6/)/SX;) holds true ||43|, 
the score functions are given by 

£l = m(lnL + l)+ln|Z| -ln|5]| - V^^)(L) -tr (5]-^Z), 



and L 



vec(S) 



Lvec (YT^ZYT^ - ^"^) 



The Hessian matrix J {9), the Fisher information matrix 
1C{0), and the biased version (according to Anfinsen et 
al. |f36|) for Cramer-Rao lower bound C{6) are necessary to 
obtain closed form expressions for the asymptotic entropy 
variance used in Equation ([3]). In particular, the following 
quantity plays a central role: 

where (•)* represents complex conjugation. Anfinsen et 
al. | |36| showed that 

5(x;-^z5]-^) 

= -T,-^ (g) E-^ZE-^ - 5]-^ (g) 5]-^5]-^Z. 

Moreover, it is known that | |43| 

T2 = dJ:~^/dJ: = -T.-^ (g) E"^ 

Thus, we have that 

Jss =L{T2 - Ti) = L(5]-^ 5]-^ - X;-^Z5]-^ 
-E"^ (g)5]"^S;"^Z). (15) 

From Equation ( p3] ) we obtain: 

/Css = E{-J^^} = L^-^ g) 5]-^ (16) 
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10 20 30 40 50 ^ 10 20 30 40 50 

L L 

(d) ifO i and (1 + /c)S (e) Hs and and S 



Fig. 1. Shannon, Tsallis, and Renyi entropies for several covariance matrices and number of looks. 



vec(O^)^ 



The analytical expressions for J (9), 1C{6), and C{6) are, 
thus, 

J{e) = 

ic{e) = E{- = 

vec(Om) LYi 

vec(O^) L-^5](g)5] 

where 0^ is the null square matrix of order m. 

Anfinsen el al. |36| derived the Fisher information matrix 
for the unsealed complex Wishart law. That approach found 
that the parameters of such distribution are not orthogonal. 
Based on J {9), we conclude that L and become orthog- 
onal under the scaling of the complex Wishart law. Among 
other benefits, such scaling makes the likelihood equations 
separable, as shown in the following. 

Consider {Zi, Z2, . . . , Zat} a random sample of size TV 
obtained from Z - Wm{^,L). Since N'^ Y.k=i V4(^) = 



0, we have that 



^ 1 ^ - 



k=l 



and 



^ 1 ^ 

m\nL^-^\n\Zk \ - In \Z\ - ^^^^ (L) = 0. (17) 



TV 

Thus, the ML estimator of X) is the sample mean, while L 
is obtained solving the system shown in Equation ([17]). The 
Newton-Raphson iterative method |44| can be used to solve 
this nonlinear system. 

The asymptotic variance given by Equation ^ is deter- 
mined by C{0) and by the term |34j 



dL 



vec 



dY 



We denote the resulting S as and ^r,^, when and 
are considered, respectively. We could not find a closed 
expression for the variance of the restricted Tsallis entropy, so 
it will not be further considered in the remainder of this work. 
Analogously, entropy variances are denoted as cr| and cr^^. 
These quantities are given by expressions (p^-([2T]). 
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(a) Shannon entropy: 



and 



(b) Renyi entropy: 



and 



(m-L)V^£^(L)+m- ^ 
m vec(5]~"^) 



2-1 2 



vec(X;"^)^(5] (g) T) vec(5]"^). 



mvec(5] 



^vec(S-i)*(S®S)vec(S-i) 



(18) 



(19) 



(20) 



(21) 



Note that, as expected from Equation ([14]), \\m^ 



>1 ^R,/3 



^s- 



The entropies, along with their variances, can be used as 
alternative goodness-of-fit tests to the one proposed in | [45| 
for the Wishart distribution, specifying r = 1 in Equation 
Cintra et ah |19| showed that these statistics outperform 
the Kolmogorov-Smimov nonparametric test in intensity SAR 
imagery. In this paper, we aim to quantify contrast in PolSAR 
images and situations with r = 2. In the following, the 
proposed tests are applied to PolSAR imagery. 

V. Applications 

In this section, we combine the entropies and variances 
derived in the previous section to form statistical tests, whose 
performance with respect to test size and power is assessed by 
Monte Carlo experiments. Finally, the discussed methodology 
is applied to real data. 

A. Entropy as a feature for discrimination 



Fig. 2(a) shows the HH band of a PolSAR L-band image 
over WeBling, Germany. This image was obtained by the E- 
SAR sensor |46| with 3.2 equivalent number of looks (ENL), 
which is a PolSAR parameter associated to the degree of 
averaging of SAR measurements during data formation p6| . 
Three regions were selected: Ai, A2, and A3, corresponding 
to areas with strong, moderate, and weak return, respectively. 
The image has approximately 3 m resolution. 

Table |ll| shows the sample size N and the ML estimates 
of the scaled Wishart distribution parameters for each region. 
The ENL estimates are close, suggesting similar levels of 
radar texture in the three regions; such levels are probably 
low, since the regions appear to be cropland. Notice that the 
smallest estimated number of looks and the highest value 
of determinant for the sample convariance matrix belong to 
region Ai, where the texture is more pronounced. Figs. |2(b) 



2(d)| present histograms and fitted densities from considered 
data for two distributions of the scaled complex Wishart 
family: W(£,3.2) and >Vi?(S,L). Additionally, Table |ll| also 
presents the Akaike information criterion (AIC), a measure of 




(a) E-SAR intensity image (HH band) 



(b) Data from region Ai 




(c) Data from region A2 



(d) Data from region A3 



Fig. 2. PolSAR image, selected regions, histo|rams (dot curve) and 
fitted scaled complex Wishart distributions: Wi^(E,L) (solid curve) and 
>V(£,3.2) (dashed curve). 



the goodness of these fits | [47| . In all cases, the Wr distribution 
presented the best fit. 

Table |III| presents the asymptotic lower and upper bounds 
for the Shannon and Renyi entropies at the 95% level of 
confidence, as computed according to Equation (|7]). The in- 
tervals are disjoint, suggesting that the entropy can be used as 
a feature for region discrimination. The smallest entropies are 
associated with regions whose estimated covariance matrices 
have small determinants, in accordance with the case study 



presented in Section IV-A4 



Noting that (i) the entropy is a measure of randomness, 
which is associated to variability, and that (ii) PolSAR areas 
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TABLE II 

ML ESTIMATES FOR THE SAMPLES FROM FlGURE[2(a)]uNDER THE 
RELAXED AND ORIGINAL WiSHART DISTRIBUTIONS, AND THEIR AIC 
VALUES 



Regions |£| L 

W(£,3.2) Wi?(£,L) 

Ai 3708 355494.500 L361 50769.93 49856.90 
As 2088 3321.241 1.657 18353.35 17931.51 
As 1079 274.189 2.557 6749.56 6629.15 



empirical size is exactly the nominal one for simulated data. 
Therefore, we conclude that the Renyi entropy for (3 = 0.1 
is generally outperformed by the Shannon and Renyi of order 
0.8, except for A/" = 9. The two last hypothesis tests have 
empirical test sizes close to the nominal levels even for small 
sample sizes. 

TABLE IV 

Empirical test sizes based on Shannon and Renyi (of ord er 0. 1 

AND 0.8) ENTROPIES FOR REGIONS Ai, A2, AND A3 FROM FlGURE [2(a)l A 
GOOD TEST IS CLOSE TO ITS NOMINAL LEVEL. 



with high reflectance are more affected by speckle noise, even 
those with negligible texture, it is intuitive that the determinant 
of the covariance matrix and the entropy are positively corre- 
lated. Moreover, the expressions of entropies are also directly 
proportional to the determinant, i.e., H^^Hs ex mln 

Goodman pSj defined the stochastic covariance matrix 
determinant as a generalized variance which is associated 
with the speckle variability, defined as the effect of the 
speckle noise resulting from multipath interference. Addi- 
tionally, when there is texture variability it is due to the 
spatial variability in reflectance, and is associated with the 
"heterogeneity", which is the usual measure of the texture 
level. This last source of variability can be captured by, 
for instance, the roughness parameter of the polarimetric 
law (491, 1501. 



TABLE III 

Estimated asymptotic intervals at 95% for the Shan non and 
Renyi entropies of samples from FiGURE[2(a)l 



ttOA 



Lower Upper Lower Upper Lower Upper 



Ai 37.979 
A2 30.079 
A3 19.611 



38.432 
30.541 
19.949 



61.083 
45.563 
35.000 



61.332 
45.867 
35.346 



44.045 44.364 
36.124 37.049 
20.901 21.230 



B. Synthetic Data 

We quantify the performance of hypothesis tests based on 
Shannon and Renyi entropies with synthetic data following the 
scaled complex Wishart distribution, generated as suggested 
in 1511. 

The simulation employed the following parameters: (i) L = 
3.2; (ii) the estimated covariance matrices from regions 
Ai, A2, and A3, denoted as XIi,Xl2, and XI3, respectively; 
(iii) sample sizes N = {9,49,81,121,400} (i.e., squared 
windows of side {3, 7, 9, 11, 20} pixels); and (iv) significance 
levels a G {1%,5%,10%|^ After generating two samples of 
size N from X - Wm{^i,L) and Y - Wm{^,L), we 
tested the the null hypothesis T-Lq: {T,i,L) = {T,j,L) for 
z,j = 1,2,3. Following |T8|, we run 5500 replicas of the 
Monte Carlo simulation for every case. Empirical test size and 
power were employed as figures of merit. These quantities are 
defined as the rejection rates of Ho when this hypothesis is 
true and is false (i.e., I]^ 7^ Ej), respectively; they are also 
called Type I error and true positive rates. 

Table |IV] presents the resulting empirical test sizes. From 
a purely statistical viewpoint, the ideal test is the one whose 





Ai 






A2 






A3 




1% 


5% 


10% 


1% 


5% 


10% 


1% 


5% 


10% 








Shannon entropy 








9 1.35 


6.15 


11.76 


1.36 


6.16 


12.02 


1.44 


6.20 


12.05 


49 1.38 


5.53 


10.44 


1.38 


5.65 


10.35 


1.35 


5.47 


10.44 


81 1.33 


5.51 


9.75 


1.35 


5.47 


9.84 


1.35 


5.36 


10.07 


121 1.18 


5.56 


10.58 


1.15 


5.51 


10.51 


1.15 


5.49 


10.44 


400 1.35 


5.38 


10.33 


1.38 


5.47 


10.40 


1.40 


5.51 


10.22 






Renyi entropy with /3 = 


0.8 






9 1.38 


6.22 


11.65 


1.44 


6.18 


11.93 


1.45 


6.09 


11.78 


49 1.27 


5.62 


10.36 


1.31 


5.65 


10.31 


1.33 


5.38 


10.55 


81 1.31 


5.47 


9.76 


1.20 


5.47 


9.91 


1.29 


5.36 


9.95 


121 1.05 


5.35 


10.45 


1.15 


5.44 


10.44 


1.13 


5.53 


10.55 


400 1.35 


5.27 


10.31 


1.36 


5.29 


10.29 


1.27 


5.33 


10.42 






Renyi entropy with /5 = 


0.1 






9 0.69 


4.04 


8.80 


0.76 


4.15 


8.65 


0.82 


4.04 


8.55 


49 0.71 


3.62 


7.82 


0.75 


3.87 


7.89 


0.69 


3.45 


7.73 


81 0.53 


3.69 


7.51 


0.56 


3.84 


7.49 


0.55 


3.62 


7.51 


121 0.60 


3.73 


7.58 


0.60 


3.40 


7.47 


0.53 


3.58 


7.64 


400 0.56 


3.84 


7.65 


0.67 


3.82 


7.64 


0.67 


3.62 


7.65 



Samples from different covariance matrices always led to 
rejecting the null hypothesis with the proposed statistical tests, 
thus leading to tests with unitary empirical test power. Table [V| 
shows the mean value of the statistics 

5500 
k=l 



where is Ss or Sr as given in ^ and [T,i/Li]k are 
the ML estimates based on generated data from population 
Ai at the kth Monte Carlo replica. The table also shows 
the coefficient of variation (CV) of the test statistics under 
alternative hypotheses: 

CV(A„A,) = 

Effi'[5j(P.,L,],, -5(A,,A,P 



S{Ai,Aj)V^EOO 

The largest means 5'(Ai, Aj) correspond to the largest 
absolute values abs(|XIi| — leading to the following 

relations: 



5(Ai,A3) >5(Ai,A2) >5(A2,A3 



(22) 



Shannon test statistics produced the largest test statistics when 
contrasting samples from different areas. 

C. Real Data 

Applying the methodology described above to real data, we 
obtained the results shown in Fig. |3] To that end, the following 
steps were followed: 
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TABLE V 

Mean and coefficient of variation under alternative 

HYPOTHESES OF TEST STATISTICS BASED ON SHANNON AND RENYI (OF 
ORDER 0.1 AND 0.8) ENTROPIES 





Ai 


-A2 


Ai 


-As 


A2-A3 


N 


CV 


S 


CV 


S 


V 


Q 


9 


20.59 


108.16 


Shannon entropy 
13.83 248.53 


38.42 


29.89 


49 


8.65 


576.23 


5.82 


1329.19 


lO.^O 


I JO. ZI 


81 


6.70 


949.51 


4.52 


2191.84 


1 '71 
IZ. / 1 


Zj0.j4 


121 


5.51 


1418.22 


3.71 


3274.03 


IU.4J 


QQQ IT 
JOJ.l 1 


400 


3.01 


4687.29 


2.02 


10817.80 


J. /z 


1 OA< AO 

izoj.oy 


9 


20.56 


Renyi entropy with (3 = 
106.72 13.72 245.22 


0.8 
38.45 


29.50 


49 


8.63 


569.69 


5.76 


1314.11 


16.40 


154.43 


81 


6.67 


938.93 


4.47 


2167.39 


12.72 


253.68 


121 


5.48 


1402.43 


3.66 


3237.61 


10.42 


378.93 


400 


3.00 


4635.65 


2.00 


10698.61 


5.71 


1251.74 


9 


21.70 


Renyi entropy with ^ = 
78.42 14.27 180.12 


0.1 
40.60 


21.79 


49 


9.11 


423.29 


6.00 


976.32 


17.44 


114.77 


81 


7.02 


698.47 


4.63 


1612.06 


13.50 


188.81 


121 


5.73 


1043.26 


3.77 


2408.63 


11.00 


282.06 


400 


3.15 


3450.84 


2.07 


7963.98 


6.03 


931.79 



and 



from areas 



1: for j = 1,2, ...,5500 do 

2: Extract two regions 
Ai, A2, and A3. 

3: By sampling without replacement, generate two N- 
point vectors u^^"^ and v^^"^ from Uj and Vj, respectively, 
granting n*^*^ ^ u^^^ and v^'^^ ^ v^^^ for i 7^ j. 

4: Estimate the parameter vectors and ^2 based on 

u^^^ and v^^^ , respectively. 
5: Compute the decision from Proposition [T] for a = 

{1%,5%,10%}. 
6: end for 

7: Let T be the number of times that the null hypothesis 
is rejected. Calculate the empirical test size (Si_c,) and 
power (1 — at level ol as 



OLx-cc = T/5500, if Vj 
1 - ^i-a = T/5500, if Vj ^ Uj, 



where /3i-a is the Type II error rate; i.e., the estimate for the 
probability of not rejection of Ho when the null hypothesis is 
false. We considered N the number of observations in squared 
windows of size {3 x 3, 4 x 4, . . . , 23 x 23}. 

Fig. |3] shows the empirical test sizes, with the nominal test 
size a in dotted line. The behavior of ai-a is consistent 
across the three values of a\ the empirical test size is larger in 
Shannon than in Renyi /3 = 0.8 which, in turn, is larger than 
Renyi (3 = 0.1. The fastest convergence to a with respect to 
the sample size is consistently observed in area A3. No test 
attains the nominal size in area Ai. This is probably due to 
its pronounced brightness and roughness, as noted in Table [II] 

In the following we select visually similar regions, but with 



distinct statistical properties. To that end. Fig. |4(a)| shows an 
EMISAR image of Foulum (Denmark) obtained with eight 
nominal looks over agricultural areas and sub 10 m resolution; 



three regions were selected for our study. Table VI presents 
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(a) Size 1% 
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100 300 500 



N 

(c) Size 10% 

Fig. 3. Empirical test size for different regions at several sample sizes and 
nominal levels. 

TABLE VI 

ML ESTIMATES FOR THE SAMPLES FROM FlGURE[4(a)l 



Regions 


N 






L 


Bi 


3192 


1.609 X 10- 


-5 


6.925 


B2 


1408 


1.112 X 10- 


-6 


11.937 


B3 


1848 


5.814 X 10- 


-7 


10.752 



4(c)| and |4(d)| show empirical densities of different areas and 
channels. The empirical densities of B2 and B3 are the closest 
for all polarization channels, as well as their estimates of L. 

We quantify the discrimination between selected regions by 
means of and S^^^ performing steps [TJ|7] with the Foulum 
samples. Since Renyi entropy converges to Shannon when the 
order tends to 1, we omit the analysis of S^-^ because it is 
similar to ^s, as illustrated in Fig. |5] 



the sample sizes and the ML estimates, while Figs. 4(b) 



Figs. 6(a) 6(b) and 6(c) exhibit the empirical test power. 
In general terms, the estimated power increase quickly when 
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(a) EMISAR Image 



1 2 3 4 5 6 
Intensity HH (xlQ-^) 

(b) HH Channel 




100 200 300 400 500 

N 

(a) 1% 



5s(Bi-B2) 



N 

(b) 5% 




Intensity HV (xlQ-^) 
(c) HV Channel 



1 2 3 4 5 6 
Intensity VV (xlQ-^) 

(d) VV Channel 



Fig. 4. PolSAR image, selected regions and their empirical densities in all 
polarizations. 



I 



100 200 300 400 500 

N 



Fig. 5. Empirical test power at the level 10%. 



sample sizes increase. The test based on the Shannon entropy 
outperforms the one based on Renyi's. 

These results suggest that the test statistics in terms of the 
Shannon entropy is a more appropriate tool for change and 
boundary detection in PolSAR images than the measure based 
on the Renyi entropy. 

VI. Conclusions 

In this paper, the closed expressions for Shannon, Renyi, 
and Tsallis entropies have been derived for data distributed 
according with the scaled complex Wishart distribution. The 
variances of the two first entropies were also derived when 
parameters are replaced by ML estimators, leading to the 
derivation of two test statistics with known asymptotic dis- 
tributions. The statistics based on the Tsallis entropy were not 
derived since its variance could not be expressed in closed 
form. 



I 



|(B,-B3) 
5t(Bi-B2) 



N 

(c) 10% 

Fig. 6. Empirical test power for different regions at several sample sizes and 
nominal levels; a good test has high power in these cases. 



New hypothesis tests and confidence intervals based on the 
(/i, ^)-family of entropies were derived for the scaled complex 
Wishart distribution. These new statistical tools provide means 
for contrasting r > 2 samples and verifying if they come from 
the same distribution. Such tools were derived by obtaining 
orthogonal maximum likelihood estimators for the scaled 
complex Wishart law, and by characterizing their asymptotic 
behavior. 

Monte Carlo experiments were employed for quantifying 
the performance of the proposed hypotheses tests. The results 
provided evidence that the statistic based on the Shannon 
entropy is the most efficient in terms of empirical test size 
and power. 

An application to actual data was also considered in order 
to assess the performance of the proposed hypothesis tests. 
The empirical test sizes observed with real data are relatively 
high when dealing with small samples, but they decrease as 
the sample size increases. As expected, the tests presented the 
worst results in the most heterogeneous situations. Neverthe- 
less, they perform correctly when the sample size is increased. 

The hypothesis test based on the Shannon entropy presented 
the smallest empirical test size. All test statistics detected 
differences among different regions at the specified levels. This 
is important for PolSAR image analysis, such as in boundary 
detection p2| and change detection p4| . 

As an overall conclusion, the Shannon entropy can be safely 
used for discriminating areas in PolSAR imagery. Moreover, 
the Shannon entropy it is not challenged in terms of simplicity 
either, which consolidates its position as the preferred entropy 
measure. However, care must be taken when small samples are 
analyzed. Indeed, in this case, the proposed tests are prone 
to classifying regions of similar natures as distinct, i.e., to 



IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 



10 



incurring in Type I error. In practice, however, this issue is 
not severe, since PolSAR image processing often handles data 
with a large number of pixels. 

Further research will consider models which include hetero- 
geneity f22], f49\, fSO'l, robust, improved and nonparametric 
inference [52J-[55J, and small samples issues \56j . 
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